Overview

Dataset statistics

Number of variables40
Number of observations7679
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 MiB
Average record size in memory320.0 B

Variable types

Numeric19
Categorical21

Warnings

Row has constant value "0" Constant
bin has constant value "0" Constant
run has constant value "0" Constant
media has constant value "0" Constant
sys has constant value "0" Constant
usr has constant value "0" Constant
root has constant value "0" Constant
host has constant value "0" Constant
brk has constant value "0" Constant
boot has constant value "0" Constant
mkdir has constant value "0" Constant
write has constant value "0" Constant
read has constant value "0" Constant
kill_API has constant value "0" Constant
mnt has constant value "0" Constant
opt has constant value "0" Constant
rename has constant value "0" Constant
srv has constant value "0" Constant
udp is highly correlated with connectHigh correlation
poll is highly correlated with etc and 9 other fieldsHigh correlation
etc is highly correlated with poll and 9 other fieldsHigh correlation
connect is highly correlated with udpHigh correlation
tcp is highly correlated with poll and 9 other fieldsHigh correlation
clone_API is highly correlated with mmap and 1 other fieldsHigh correlation
var is highly correlated with poll and 9 other fieldsHigh correlation
network_http is highly correlated with poll and 9 other fieldsHigh correlation
File_IO is highly correlated with poll and 9 other fieldsHigh correlation
accept is highly correlated with poll and 9 other fieldsHigh correlation
munmap is highly correlated with poll and 9 other fieldsHigh correlation
mmap is highly correlated with poll and 11 other fieldsHigh correlation
chdir_API is highly correlated with poll and 9 other fieldsHigh correlation
network_connection is highly correlated with poll and 9 other fieldsHigh correlation
execve_API is highly correlated with clone_API and 1 other fieldsHigh correlation
Normal is highly correlated with AttackHigh correlation
Attack is highly correlated with NormalHigh correlation
mnt is highly correlated with Attack and 19 other fieldsHigh correlation
Attack is highly correlated with mnt and 18 other fieldsHigh correlation
sys is highly correlated with mnt and 19 other fieldsHigh correlation
boot is highly correlated with mnt and 19 other fieldsHigh correlation
rename is highly correlated with mnt and 19 other fieldsHigh correlation
opt is highly correlated with mnt and 19 other fieldsHigh correlation
srv is highly correlated with mnt and 19 other fieldsHigh correlation
host is highly correlated with mnt and 19 other fieldsHigh correlation
media is highly correlated with mnt and 19 other fieldsHigh correlation
usr is highly correlated with mnt and 19 other fieldsHigh correlation
kill_API is highly correlated with mnt and 19 other fieldsHigh correlation
write is highly correlated with mnt and 19 other fieldsHigh correlation
brk is highly correlated with mnt and 19 other fieldsHigh correlation
root is highly correlated with mnt and 19 other fieldsHigh correlation
mkdir is highly correlated with mnt and 19 other fieldsHigh correlation
Row is highly correlated with mnt and 19 other fieldsHigh correlation
home is highly correlated with mnt and 17 other fieldsHigh correlation
bin is highly correlated with mnt and 19 other fieldsHigh correlation
run is highly correlated with mnt and 19 other fieldsHigh correlation
Normal is highly correlated with mnt and 18 other fieldsHigh correlation
read is highly correlated with mnt and 19 other fieldsHigh correlation
proc is highly skewed (γ1 = 25.09061791) Skewed
udp has 7078 (92.2%) zeros Zeros
poll has 110 (1.4%) zeros Zeros
dev has 6821 (88.8%) zeros Zeros
etc has 1002 (13.0%) zeros Zeros
connect has 7128 (92.8%) zeros Zeros
tcp has 178 (2.3%) zeros Zeros
clone_API has 4909 (63.9%) zeros Zeros
proc has 7662 (99.8%) zeros Zeros
var has 643 (8.4%) zeros Zeros
network_http has 642 (8.4%) zeros Zeros
File_IO has 643 (8.4%) zeros Zeros
accept has 279 (3.6%) zeros Zeros
munmap has 1002 (13.0%) zeros Zeros
mmap has 1002 (13.0%) zeros Zeros
chdir_API has 644 (8.4%) zeros Zeros
network_connection has 110 (1.4%) zeros Zeros
execve_API has 4909 (63.9%) zeros Zeros

Reproduction

Analysis started2021-03-08 11:40:26.175939
Analysis finished2021-03-08 11:41:24.389934
Duration58.21 seconds
Software versionpandas-profiling v2.11.0
Download configurationconfig.yaml

Variables

df_index
Real number (ℝ≥0)

Distinct2880
Distinct (%)37.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1291.246386
Minimum0
Maximum2879
Zeros3
Zeros (%)< 0.1%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile127.9
Q1639.5
median1279
Q31919
95-th percentile2507.1
Maximum2879
Range2879
Interquartile range (IQR)1279.5

Descriptive statistics

Standard deviation759.5275562
Coefficient of variation (CV)0.5882127255
Kurtosis-1.07187065
Mean1291.246386
Median Absolute Deviation (MAD)640
Skewness0.09190132639
Sum9915481
Variance576882.1086
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20473
 
< 0.1%
8813
 
< 0.1%
9093
 
< 0.1%
9053
 
< 0.1%
9013
 
< 0.1%
8973
 
< 0.1%
8933
 
< 0.1%
8893
 
< 0.1%
8853
 
< 0.1%
8773
 
< 0.1%
Other values (2870)7649
99.6%
ValueCountFrequency (%)
03
< 0.1%
13
< 0.1%
23
< 0.1%
33
< 0.1%
43
< 0.1%
ValueCountFrequency (%)
28791
< 0.1%
28781
< 0.1%
28771
< 0.1%
28761
< 0.1%
28751
< 0.1%

Row
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

udp
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct164
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.32621435
Minimum0
Maximum1313
Zeros7078
Zeros (%)92.2%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile108
Maximum1313
Range1313
Interquartile range (IQR)0

Descriptive statistics

Standard deviation89.36345168
Coefficient of variation (CV)4.623950146
Kurtosis32.93271225
Mean19.32621435
Median Absolute Deviation (MAD)0
Skewness5.411186197
Sum148406
Variance7985.826496
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
07078
92.2%
2761
 
0.8%
32443
 
0.6%
21642
 
0.5%
16237
 
0.5%
37837
 
0.5%
928
 
0.4%
27026
 
0.3%
424
 
0.3%
1023
 
0.3%
Other values (154)280
 
3.6%
ValueCountFrequency (%)
07078
92.2%
33
 
< 0.1%
424
 
0.3%
517
 
0.2%
66
 
0.1%
ValueCountFrequency (%)
13131
< 0.1%
9941
< 0.1%
9061
< 0.1%
8441
< 0.1%
7911
< 0.1%

select
Real number (ℝ≥0)

Distinct54
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean41.69696575
Minimum0
Maximum88
Zeros2
Zeros (%)< 0.1%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile38
Q140
median42
Q344
95-th percentile46
Maximum88
Range88
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.776544315
Coefficient of variation (CV)0.09057120217
Kurtosis23.5667316
Mean41.69696575
Median Absolute Deviation (MAD)2
Skewness-2.316131686
Sum320191
Variance14.26228696
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
442284
29.7%
422062
26.9%
401610
21.0%
38416
 
5.4%
39230
 
3.0%
48213
 
2.8%
41177
 
2.3%
46141
 
1.8%
43116
 
1.5%
3692
 
1.2%
Other values (44)338
 
4.4%
ValueCountFrequency (%)
02
< 0.1%
61
 
< 0.1%
101
 
< 0.1%
111
 
< 0.1%
123
< 0.1%
ValueCountFrequency (%)
881
< 0.1%
761
< 0.1%
721
< 0.1%
681
< 0.1%
641
< 0.1%

bin
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

run
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

poll
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct613
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean270.4533142
Minimum0
Maximum2785
Zeros110
Zeros (%)1.4%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile3
Q119
median160
Q3380
95-th percentile867
Maximum2785
Range2785
Interquartile range (IQR)361

Descriptive statistics

Standard deviation385.3489373
Coefficient of variation (CV)1.424826086
Kurtosis13.67430323
Mean270.4533142
Median Absolute Deviation (MAD)148
Skewness3.439609102
Sum2076811
Variance148493.8035
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
380919
 
12.0%
154879
 
11.4%
176578
 
7.5%
3441
 
5.7%
418384
 
5.0%
16340
 
4.4%
360266
 
3.5%
152190
 
2.5%
13155
 
2.0%
174141
 
1.8%
Other values (603)3386
44.1%
ValueCountFrequency (%)
0110
 
1.4%
1125
 
1.6%
228
 
0.4%
3441
5.7%
4126
 
1.6%
ValueCountFrequency (%)
27851
< 0.1%
27101
< 0.1%
27011
< 0.1%
26801
< 0.1%
26711
< 0.1%

media
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

sys
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

dev
Real number (ℝ≥0)

ZEROS

Distinct103
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.7754916
Minimum0
Maximum191
Zeros6821
Zeros (%)88.8%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile9
Maximum191
Range191
Interquartile range (IQR)0

Descriptive statistics

Standard deviation14.54313785
Coefficient of variation (CV)5.239842141
Kurtosis58.13292846
Mean2.7754916
Median Absolute Deviation (MAD)0
Skewness7.271622848
Sum21313
Variance211.5028585
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
06821
88.8%
9376
 
4.9%
2198
 
2.6%
7432
 
0.4%
2323
 
0.3%
8319
 
0.2%
2016
 
0.2%
115
 
0.2%
2614
 
0.2%
187
 
0.1%
Other values (93)158
 
2.1%
ValueCountFrequency (%)
06821
88.8%
115
 
0.2%
2198
 
2.6%
31
 
< 0.1%
53
 
< 0.1%
ValueCountFrequency (%)
1911
< 0.1%
1801
< 0.1%
1681
< 0.1%
1671
< 0.1%
1652
< 0.1%

etc
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct620
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean892.6608933
Minimum0
Maximum11270
Zeros1002
Zeros (%)13.0%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q136
median252
Q31275
95-th percentile4152
Maximum11270
Range11270
Interquartile range (IQR)1239

Descriptive statistics

Standard deviation1594.139622
Coefficient of variation (CV)1.785828901
Kurtosis15.3953502
Mean892.6608933
Median Absolute Deviation (MAD)252
Skewness3.694304316
Sum6854743
Variance2541281.135
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2521249
16.3%
01002
13.0%
36992
12.9%
288823
10.7%
1230557
 
7.3%
72297
 
3.9%
1353278
 
3.6%
1365175
 
2.3%
216172
 
2.2%
1275143
 
1.9%
Other values (610)1991
25.9%
ValueCountFrequency (%)
01002
13.0%
181
 
< 0.1%
36992
12.9%
461
 
< 0.1%
561
 
< 0.1%
ValueCountFrequency (%)
112701
< 0.1%
111261
< 0.1%
111101
< 0.1%
109281
< 0.1%
107921
< 0.1%

usr
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

root
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

host
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

brk
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

boot
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

mkdir
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

write
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

connect
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct141
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.687329079
Minimum0
Maximum399
Zeros7128
Zeros (%)92.8%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile32.4
Maximum399
Range399
Interquartile range (IQR)0

Descriptive statistics

Standard deviation35.75215376
Coefficient of variation (CV)5.346253091
Kurtosis55.11622371
Mean6.687329079
Median Absolute Deviation (MAD)0
Skewness7.08418842
Sum51352
Variance1278.216499
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
07128
92.8%
12108
 
1.4%
7244
 
0.6%
4844
 
0.6%
3638
 
0.5%
8436
 
0.5%
431
 
0.4%
6026
 
0.3%
9618
 
0.2%
2415
 
0.2%
Other values (131)191
 
2.5%
ValueCountFrequency (%)
07128
92.8%
11
 
< 0.1%
431
 
0.4%
61
 
< 0.1%
82
 
< 0.1%
ValueCountFrequency (%)
3991
< 0.1%
3771
< 0.1%
3762
< 0.1%
3741
< 0.1%
3721
< 0.1%

tcp
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct1075
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2994.402396
Minimum0
Maximum27890
Zeros178
Zeros (%)2.3%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile4
Q1159
median1569
Q34580
95-th percentile10679.9
Maximum27890
Range27890
Interquartile range (IQR)4421

Descriptive statistics

Standard deviation4188.827966
Coefficient of variation (CV)1.398886125
Kurtosis11.02219724
Mean2994.402396
Median Absolute Deviation (MAD)1425
Skewness3.104270661
Sum22994016
Variance17546279.73
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4580802
 
10.4%
1533671
 
8.7%
1752415
 
5.4%
4393
 
5.1%
153328
 
4.3%
5038304
 
4.0%
4440251
 
3.3%
4574180
 
2.3%
0178
 
2.3%
1520160
 
2.1%
Other values (1065)3997
52.1%
ValueCountFrequency (%)
0178
2.3%
271
 
0.9%
4393
5.1%
61
 
< 0.1%
84
 
0.1%
ValueCountFrequency (%)
278901
< 0.1%
270681
< 0.1%
270581
< 0.1%
266871
< 0.1%
266491
< 0.1%

clone_API
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct417
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean104.8107827
Minimum0
Maximum2389
Zeros4909
Zeros (%)63.9%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q399
95-th percentile666.3
Maximum2389
Range2389
Interquartile range (IQR)99

Descriptive statistics

Standard deviation295.0551487
Coefficient of variation (CV)2.815122082
Kurtosis18.56287496
Mean104.8107827
Median Absolute Deviation (MAD)0
Skewness4.282169809
Sum804842
Variance87057.54077
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
04909
63.9%
90584
 
7.6%
99340
 
4.4%
117222
 
2.9%
108161
 
2.1%
126159
 
2.1%
189145
 
1.9%
13580
 
1.0%
19880
 
1.0%
14470
 
0.9%
Other values (407)929
 
12.1%
ValueCountFrequency (%)
04909
63.9%
624
 
0.3%
124
 
0.1%
153
 
< 0.1%
187
 
0.1%
ValueCountFrequency (%)
23891
< 0.1%
21011
< 0.1%
20521
< 0.1%
20141
< 0.1%
20101
< 0.1%

proc
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.07722359682
Minimum0
Maximum48
Zeros7662
Zeros (%)99.8%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum48
Range48
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.816511575
Coefficient of variation (CV)23.52275275
Kurtosis644.1884744
Mean0.07722359682
Median Absolute Deviation (MAD)0
Skewness25.09061791
Sum593
Variance3.299714302
MonotocityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
07662
99.8%
4810
 
0.1%
272
 
< 0.1%
142
 
< 0.1%
52
 
< 0.1%
211
 
< 0.1%
ValueCountFrequency (%)
07662
99.8%
52
 
< 0.1%
142
 
< 0.1%
211
 
< 0.1%
272
 
< 0.1%
ValueCountFrequency (%)
4810
0.1%
272
 
< 0.1%
211
 
< 0.1%
142
 
< 0.1%
52
 
< 0.1%

read
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

var
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct689
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean782.5147806
Minimum0
Maximum7790
Zeros643
Zeros (%)8.4%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q145
median231
Q31340
95-th percentile3175.4
Maximum7790
Range7790
Interquartile range (IQR)1295

Descriptive statistics

Standard deviation1215.956737
Coefficient of variation (CV)1.553908971
Kurtosis10.10999238
Mean782.5147806
Median Absolute Deviation (MAD)219
Skewness2.990883718
Sum6008931
Variance1478550.787
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2311227
16.0%
264800
 
10.4%
0643
 
8.4%
24546
 
7.1%
1340431
 
5.6%
12421
 
5.5%
45339
 
4.4%
1170247
 
3.2%
1474237
 
3.1%
198167
 
2.2%
Other values (679)2621
34.1%
ValueCountFrequency (%)
0643
8.4%
115
 
0.1%
12421
5.5%
2315
 
0.2%
24546
7.1%
ValueCountFrequency (%)
77901
< 0.1%
76941
< 0.1%
75841
< 0.1%
73761
< 0.1%
73541
< 0.1%

network_http
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct280
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43.90532621
Minimum0
Maximum461
Zeros642
Zeros (%)8.4%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q14
median14
Q380
95-th percentile146
Maximum461
Range461
Interquartile range (IQR)76

Descriptive statistics

Standard deviation62.76364468
Coefficient of variation (CV)1.42952234
Kurtosis9.742482377
Mean43.90532621
Median Absolute Deviation (MAD)12
Skewness2.839614226
Sum337149
Variance3939.275094
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
141226
16.0%
801106
14.4%
4878
11.4%
16802
10.4%
0642
8.4%
2429
 
5.6%
88399
 
5.2%
60305
 
4.0%
8217
 
2.8%
12209
 
2.7%
Other values (270)1466
19.1%
ValueCountFrequency (%)
0642
8.4%
118
 
0.2%
2429
5.6%
341
 
0.5%
4878
11.4%
ValueCountFrequency (%)
4611
< 0.1%
4371
< 0.1%
4301
< 0.1%
4191
< 0.1%
4131
< 0.1%

File_IO
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct881
Distinct (%)11.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2200.044146
Minimum0
Maximum23052
Zeros643
Zeros (%)8.4%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q1130
median602
Q33290
95-th percentile10994
Maximum23052
Range23052
Interquartile range (IQR)3160

Descriptive statistics

Standard deviation3751.367907
Coefficient of variation (CV)1.705133014
Kurtosis11.89239048
Mean2200.044146
Median Absolute Deviation (MAD)558
Skewness3.318102842
Sum16894139
Variance14072761.17
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6021227
 
16.0%
688800
 
10.4%
0643
 
8.4%
94412
 
5.4%
115339
 
4.4%
3170300
 
3.9%
3000244
 
3.2%
44176
 
2.3%
516167
 
2.2%
3487154
 
2.0%
Other values (871)3217
41.9%
ValueCountFrequency (%)
0643
8.4%
282
 
< 0.1%
29118
 
1.5%
431
 
< 0.1%
44176
 
2.3%
ValueCountFrequency (%)
230521
< 0.1%
226721
< 0.1%
225741
< 0.1%
225091
< 0.1%
224031
< 0.1%

kill_API
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

accept
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct369
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68.38572731
Minimum0
Maximum756
Zeros279
Zeros (%)3.6%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile1
Q14
median44
Q3100
95-th percentile217.2
Maximum756
Range756
Interquartile range (IQR)96

Descriptive statistics

Standard deviation95.71877458
Coefficient of variation (CV)1.399689356
Kurtosis14.07980542
Mean68.38572731
Median Absolute Deviation (MAD)42
Skewness3.441699688
Sum525134
Variance9162.083808
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1001103
14.4%
42702
 
9.1%
1684
 
8.9%
3597
 
7.8%
110399
 
5.2%
48391
 
5.1%
41346
 
4.5%
90346
 
4.5%
2340
 
4.4%
47300
 
3.9%
Other values (359)2471
32.2%
ValueCountFrequency (%)
0279
3.6%
1684
8.9%
2340
4.4%
3597
7.8%
4164
 
2.1%
ValueCountFrequency (%)
7561
< 0.1%
7391
< 0.1%
7182
< 0.1%
7021
< 0.1%
7001
< 0.1%

home
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7677 
12
 
1
48
 
1

Length

Max length2
Median length1
Mean length1.000260451
Min length1

Characters and Unicode

Total characters7681
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07677
> 99.9%
121
 
< 0.1%
481
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
07677
> 99.9%
481
 
< 0.1%
121
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
07677
99.9%
41
 
< 0.1%
81
 
< 0.1%
11
 
< 0.1%
21
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7681
100.0%

Most frequent character per category

ValueCountFrequency (%)
07677
99.9%
41
 
< 0.1%
81
 
< 0.1%
11
 
< 0.1%
21
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common7681
100.0%

Most frequent character per script

ValueCountFrequency (%)
07677
99.9%
41
 
< 0.1%
81
 
< 0.1%
11
 
< 0.1%
21
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII7681
100.0%

Most frequent character per block

ValueCountFrequency (%)
07677
99.9%
41
 
< 0.1%
81
 
< 0.1%
11
 
< 0.1%
21
 
< 0.1%

munmap
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct506
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean229.6476104
Minimum0
Maximum2862
Zeros1002
Zeros (%)13.0%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q18
median56
Q3318
95-th percentile1312.1
Maximum2862
Range2862
Interquartile range (IQR)310

Descriptive statistics

Standard deviation429.9433982
Coefficient of variation (CV)1.872187555
Kurtosis13.5237917
Mean229.6476104
Median Absolute Deviation (MAD)56
Skewness3.564770303
Sum1763464
Variance184851.3257
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
561252
16.3%
01002
13.0%
8992
12.9%
64823
10.7%
300557
 
7.3%
16297
 
3.9%
330292
 
3.8%
354184
 
2.4%
348178
 
2.3%
48172
 
2.2%
Other values (496)1930
25.1%
ValueCountFrequency (%)
01002
13.0%
41
 
< 0.1%
8992
12.9%
121
 
< 0.1%
16297
 
3.9%
ValueCountFrequency (%)
28621
< 0.1%
28341
< 0.1%
28331
< 0.1%
27911
< 0.1%
27471
< 0.1%

mnt
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

opt
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

rename
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

srv
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
7679 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
07679
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
07679
100.0%

Most occurring characters

ValueCountFrequency (%)
07679
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
07679
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
07679
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
07679
100.0%

mmap
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct614
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean588.3469202
Minimum0
Maximum7873
Zeros1002
Zeros (%)13.0%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q18
median56
Q3726
95-th percentile4128.5
Maximum7873
Range7873
Interquartile range (IQR)718

Descriptive statistics

Standard deviation1259.721482
Coefficient of variation (CV)2.141120211
Kurtosis12.0644703
Mean588.3469202
Median Absolute Deviation (MAD)56
Skewness3.481132565
Sum4517916
Variance1586898.212
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
561250
16.3%
01002
13.0%
8992
12.9%
64823
10.7%
660557
 
7.3%
16297
 
3.9%
726277
 
3.6%
1146176
 
2.3%
48172
 
2.2%
822144
 
1.9%
Other values (604)1989
25.9%
ValueCountFrequency (%)
01002
13.0%
41
 
< 0.1%
8992
12.9%
16297
 
3.9%
201
 
< 0.1%
ValueCountFrequency (%)
78731
< 0.1%
76661
< 0.1%
75851
< 0.1%
75701
< 0.1%
74711
< 0.1%

chdir_API
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct352
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean81.25680427
Minimum0
Maximum1139
Zeros644
Zeros (%)8.4%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q18
median28
Q3120
95-th percentile292
Maximum1139
Range1139
Interquartile range (IQR)112

Descriptive statistics

Standard deviation135.4666731
Coefficient of variation (CV)1.667142516
Kurtosis15.87619318
Mean81.25680427
Median Absolute Deviation (MAD)24
Skewness3.738917399
Sum623971
Variance18351.21953
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1201431
18.6%
281231
16.0%
8907
11.8%
32809
10.5%
0644
8.4%
132540
 
7.0%
4449
 
5.8%
16227
 
3.0%
24211
 
2.7%
12121
 
1.6%
Other values (342)1109
14.4%
ValueCountFrequency (%)
0644
8.4%
22
 
< 0.1%
4449
5.8%
66
 
0.1%
8907
11.8%
ValueCountFrequency (%)
11391
< 0.1%
10591
< 0.1%
10561
< 0.1%
10391
< 0.1%
10171
< 0.1%

network_connection
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct1196
Distinct (%)15.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3174.038026
Minimum0
Maximum29458
Zeros110
Zeros (%)1.4%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile4
Q1169
median1626
Q34826
95-th percentile11328.7
Maximum29458
Range29458
Interquartile range (IQR)4657

Descriptive statistics

Standard deviation4486.939295
Coefficient of variation (CV)1.413637536
Kurtosis11.22069377
Mean3174.038026
Median Absolute Deviation (MAD)1474
Skewness3.133876336
Sum24373438
Variance20132624.24
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1589671
 
8.7%
4417
 
5.4%
1816415
 
5.4%
161328
 
4.3%
4820244
 
3.2%
4680203
 
2.6%
1576160
 
2.1%
157147
 
1.9%
159144
 
1.9%
14134
 
1.7%
Other values (1186)4816
62.7%
ValueCountFrequency (%)
0110
 
1.4%
271
 
0.9%
4417
5.4%
51
 
< 0.1%
61
 
< 0.1%
ValueCountFrequency (%)
294581
< 0.1%
288441
< 0.1%
286561
< 0.1%
284071
< 0.1%
283871
< 0.1%

execve_API
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct373
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47.47415028
Minimum0
Maximum1146
Zeros4909
Zeros (%)63.9%
Memory size60.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q333
95-th percentile301.1
Maximum1146
Range1146
Interquartile range (IQR)33

Descriptive statistics

Standard deviation144.6021509
Coefficient of variation (CV)3.045913409
Kurtosis20.85290555
Mean47.47415028
Median Absolute Deviation (MAD)0
Skewness4.50729672
Sum364554
Variance20909.78206
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
04909
63.9%
30586
 
7.6%
33287
 
3.7%
48185
 
2.4%
84180
 
2.3%
42146
 
1.9%
51101
 
1.3%
9088
 
1.1%
3683
 
1.1%
9371
 
0.9%
Other values (363)1043
 
13.6%
ValueCountFrequency (%)
04909
63.9%
324
 
0.3%
67
 
0.1%
71
 
< 0.1%
97
 
0.1%
ValueCountFrequency (%)
11461
< 0.1%
11371
< 0.1%
10951
< 0.1%
10711
< 0.1%
10661
< 0.1%

Normal
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
0
5400 
1
2279 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
05400
70.3%
12279
29.7%
Histogram of lengths of the category
ValueCountFrequency (%)
05400
70.3%
12279
29.7%

Most occurring characters

ValueCountFrequency (%)
05400
70.3%
12279
29.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
05400
70.3%
12279
29.7%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
05400
70.3%
12279
29.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
05400
70.3%
12279
29.7%

Attack
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
1
5400 
0
2279 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters7679
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
15400
70.3%
02279
29.7%
Histogram of lengths of the category
ValueCountFrequency (%)
15400
70.3%
02279
29.7%

Most occurring characters

ValueCountFrequency (%)
15400
70.3%
02279
29.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number7679
100.0%

Most frequent character per category

ValueCountFrequency (%)
15400
70.3%
02279
29.7%

Most occurring scripts

ValueCountFrequency (%)
Common7679
100.0%

Most frequent character per script

ValueCountFrequency (%)
15400
70.3%
02279
29.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII7679
100.0%

Most frequent character per block

ValueCountFrequency (%)
15400
70.3%
02279
29.7%

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

df_indexRowudpselectbinrunpollmediasysdevetcusrroothostbrkbootmkdirwriteconnecttcpclone_APIprocreadvarnetwork_httpFile_IOkill_APIaccepthomemunmapmntoptrenamesrvmmapchdir_APInetwork_connectionexecve_APINormalAttack
0009400034000138000000004061800104536708042000014610503901
1100440050600232228000000024621042748016958579620127072800002222170656913101
220042003600001230000000004440900011706030000900300000066012046803001
330044003600001230000000004434900011706030000900300000066012046743001
440044003600001230000000004428900011706030000900300000066012046683001
550040003600001230000000004428900011706030000900300000066012046683001
660042003600001230000000004440900011706030000900300000066012046803001
770044003600001230000000004440900011706030000900300000066012046803001
880044003600001230000000004440900011706030000900300000066012046803001
990274200372000141000000001244401020011646032790900350000080212047163601

Last rows

df_indexRowudpselectbinrunpollmediasysdevetcusrroothostbrkbootmkdirwriteconnecttcpclone_APIprocreadvarnetwork_httpFile_IOkill_APIaccepthomemunmapmntoptrenamesrvmmapchdir_APInetwork_connectionexecve_APINormalAttack
7669251000440015400025200000000153300023114602042056000056281589001
7670251100440017600028800000000175100026416688046064000064321815001
7671251200440015400025200000000153000023114602041056000056281586001
7672251300400015400025200000000153300023114602042056000056281589001
7673251400440017600028800000000175100026416688047064000064321815001
7674251500400017600028800000000174800026416688047064000064321812001
7675251600380015400025200000000153300023114602042056000056281589001
7676251700380015400025200000000153300023114602042056000056281589001
7677251800400015400025200000000153200023114602041056000056281588001
7678251900400026200054000000000281400051232130706601200000120602934001